Longest increasing subsequence as expectation of a simple nonlinear stochastic PDE 

with a low noise intensity 
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We report some new observation concerning the statistics of Longest Increasing Subsequences 
(LIS). We show that the expectation of LIS, its variance, and apparently the full distribution func- 
tion appears in statistical analysis of some simple nonlinear stochastic partial differential equation 
(SPDE) in the limit of very low noise intensity. 



I. INTRODUCTION 



Last decade is marked by a breakthrough in the solu- 
tion of a classical probabilistic problem, known as " Ulam 
problem" discussed in the mathematical literature over 
the years [ij. The general setting of the Ulam problem is 
as follows. Take the unit interval [0, 1] and pick up from it 
one after another N random numbers {N ^ 1) with uni- 
form probability distribution. Having the sequence of N 
random numbers, one can extract from it the longest in- 
creasing subsequence (LIS) of k elements. The entries of 
this subsequence are not obliged to be the nearest neigh- 
bors. There are two basic questions of interest: i) what 
is the expectation {k{N)) of LIS; and ii) what are the 
fluctuations of the mean length of LIS. The first question 
has been answered in 1975 by Vershik and Kerov who 
have shown that {k{N)) = for iV > 1. They de- 

rived this result by mapping the expectation of the LIS 
to the expectation of the first row of ensemble of Young 
tableaux over the Plancherel measure. If the initial ran- 
dom sequence contains repeated numbers, then the first 
line of the Young tableau corresponds to the longest non- 
decreasing subsequence. 

The progress in treatment of fluctuations of LIS was 
the subject of recent investigations [1, 13, IE fl S Hi- 
In these works an exact asymptotic form of a prop- 
erly normalized full limiting distribution has been estab- 
lished (the so-called Tracy- Widom distribution) of the 
first row of the Young tableau over the Plancherel mea- 
sure by mapping to the distribution of the largest eigen- 
values of some classes of random matrices [3|. In par- 
ticular it has been shown that the properly normalized 
variance of LIS has for N ^ 1 the asymptotic scaling 
form Va.T[k{N)] = (k'^iN)) - {k{N)f = cN^/^, where 
c={[x - (x)]^) = 0.8132... and x has the Tracy-Widom 
distribution for the Gaussian Unitary Ensemble. 



We would like also to point out a deep connection be- 
tween the distributions of edge states of random ma- 
trices and of statistical characteristics of some random 
growth models. In one of the most stimulating work 
it has been shown that a (1-f l)-dimensional model of 
directed polymers in random environment, which is in 
the KPZ universality class, has the Tracy-Widom dis- 
tribution for the scaled height (energy). Around the 
same time the authors of @] have found an exact map- 
ping between a specific polynuclear growth (PNG) model 
and the LIS problem. The same Tracy-Widom distribu- 
tion was reported in another class of (l+l)-dimensional 
growth models called " oriented digital boiling" model 0] . 

In our work we report some new observation con- 
cerning the statistics of longest increasing subsequences. 
Namely, we show that the expectation of LIS, its vari- 
ance, and apparently the full distribution function ap- 
pears in a statistical analysis of some properly scaled 
simple nonlinear stochastic partial differential equation 
(SPDE) in a limit of very low noise intensity. 

The paper is organized as follows. In Section [ll] we 
introduce all the necessary definitions. In particular, we 
describe LIS as the hight profile of some uniform dis- 
crete growth problem with nonlocal long-ranged inter- 
actions. In Section |TTT] we consider the discrete short- 
ranged asymmetric uniform growth process and derive 
the corresponding continuous space-time SPDE for the 
height profile. In Section [TVl we show that the properly 
scaled limit of an infinitely small noise in the derived 
SPDE adequately describes the growth with long-ranged 
interactions, i.e. the statistics of LIS. The discussion and 
conclusions are collected in Section FVl 



II. LIS AS A HEIGHT PROFILE IN A 
UNIFORM ASYMMETRIC NON-LOCAL 
GROWTH PROCESS 
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The standard construction of a Young tableau for 
the set of real numbers is realized via the well known 
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Robinson~Schensted~Knuth algorithm (RSK) which can 
be found in many textbook on representation theory, and 
for example, in [13] ■ The RSK algorithm ensures that 
the first row of the corresponding Young tableau would 
be the Longest Increasing Subsequence (LIS) for a given 
set of numbers. To be specific, consider the example 
of = 8 numbers: 4,4,3,6,7,3,4,2 taken at random 
with uniform probability distribution from the support 
{!,... ,9}. Following the RSK algorithm of the Young 
tableau construction, we get the Young tableau shown 
in Figllji. Let us describe now more geometrically obvi- 
ous construction of LIS which refers to the discrete uni- 
form asymmetric ballistic deposition of elementary cells 
with long-ranged interactions. This construction has ap- 
peared for the first time in 

Consider the (1+1)D model of ballistic deposition in 
which the columnar growth occurs sequentially on a lin- 
ear substrate consisting of L columns with free boundary 
conditions. The time t is discrete and increases by 1 with 
every deposition event. We start with the flat initial con- 
dition, i.e., an empty substrate at t = 0. At any stage 
of the growth, a column (say the column m) is chosen at 
random with probability P — and a " cell" is deposited 
there which increases the height of this column by one 
unit: hm ^ hm + 1. Once this "cell" is deposited, it 
screens all the sites at the same level in all the columns 
to its right from future deposition, i.e. the heights at 
all the columns to the right of the column ni must be 
strictly greater than or equal to hm -I- 1 at all subsequent 
times. Formally such a growth is implemented by the 
following update rule. If the site m is chosen at time t 
for deposition, then 

= max{;i„(t),/i„_i(i),...,/ii(i)} + l (1) 

The model is anisotropic and long-ranged and evidently 
even the average height profile {hm{i)) depends non- 
trivially on both the column number k and time t. 
Let us demonstrate now the a bijection between the 
longest nondecreasing subsequence in the sequence of 
N random numbers uniformly taken from the support 
{1,2,3,...L} and the height in the uniformly growing 
heap with anisotropic infinite-ranged interactions in a 
bounding box containing L columns. 

This bijection is defined by assigning the first line in 
the Young tableau to the "most top" (or "visible" from 
the left-hand side) blocks — see the Fig[T]3. For the con- 
figuration shown in Fig[T]we have the sequence of = 8 
numbers: 4, 4, 3, 6, 7, 3, 4, 2 taken at random from the set 
{1,...,9} (there are L = 9 columns in the box). The 
"visible" blocks define the longest nondecreasing subse- 
quence (LNS): 2,3,4,7. To be exact, if there are few 
LNSs, our construction extracts one of them - exactly as 
in the RSK scheme. 

As one can see in FigdJ the first line in the Young 
tableau H]) exactly matches the " most top" subsequence 
defined in our model and the distribution of the maximal 
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FIG. 1: (a) Young tableau; (b) Asymmetric Ballistic Deposi- 
tion. 



height of a heap coincides with the distribution of the 
longest nondecreasing subsequence. 



III. SHORT-RANGED ASYMMETRIC 
BALLISTIC GROWTH AND ITS CONTINUOUS 
LIMIT 



Consider a one-dimensional discrete model of ballistic 
deposition with asymmetric (one-sided) next-nearest- 
neighboring (NNN) interactions 

hj{t + 1) = max{/ij(i), h.j^i{t)} + iqj{t) (2a) 

where hj{t) is the height of a surface at the lattice posi- 
tion j and time t (the time is discrete as well) and rjj {t) 
is a telegraph-like uncorrelated noise: 



1 with probability p 
with probability I — p 



(2b) 



where 



=P 

[(Vjitl) - Vjih)^ (?7m(^2) - '7m(i2)) = p{l ~ p) Sj^„^ 6t,^t2 

(2c) 

The bar denotes averaging over the distribution ((2b)) . 
Equations (Pa|) - Pc|) completely describe the updating 
rules for the NNN discrete ballistic deposition with the 
asymmetric interactions shown in Fig (21 



□ 



□ 



□ 



FIG. 2: Updating rules for the asymmetric NNN ballistic de- 
position defined in Eq.((2a|. 
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In order to derive a continuous model corresponding to 
,,Ja|) . We represent the max{...} operator in (|2ap at time 
i by a sign-function sign[z]: 

{hj-i,hj}= (1 + sign[ft.j_i - /ij]) 

+ i/ij (1 + sign [/ij - 



max{ft,,,_ 



where 



sign[z] 



-1 for z > 
-1 for z < 



Substituting ^ into (Pa|) we get after some simple alge- 
bra 

hj{t + 1) - hj{t) = -^{hj{t) - hj^i{t)) 

+ \{h,{t) - h,^i{t))sign[h,it) - h,.i{t)] + rj.it) 

.(4) 

In the last expression one can easily identify the finite- 
difference derivatives. Denoting the spatial and temporal 
increments by Ax and At and taking into account that 
sign[az] = sign[z] (for a > 0), we arrive at the following 
expression 



dth{x,t) 



-i|f9,/i(:E,i)(l-sign [d^KxM 



h-qix^t) 



(5) 



At 



where dth(x,t) and dxh{x,t) denote the partial deriva- 
tives of h(x,t) with respect to t and x correspondingly. 
Setting in (O At = Ax = 1 we arrive at the nonlinear 
stochastic partial differential equation (SPDE) which is 
a continuous analog of the discrete Asymmetric Ballistic 
Deposition described by (Pa|) . The noise in ([5]) is still 
defied by ((2bl) ^((2cl). 



It is known 12] that symmetric Ballistic Deposition 
model with NNN interactions in the continuous limit has 
some relevance to KPZ equation 13]. For better compar- 
ison of the KPZ equation with our SPDE one, rewrite ^ 
as follows: 

dth{x,t) = i \dxh{x,t) \ - ^dxh{x,t) +Tj{x,t) (6a) 



or, equivalently. 



dth{x,t) 



r]{x,t) — dxh{x,t) if dxh{x,t) < 
■q{x,t) if dxh{x,t)>0 



(6b) 

with ri{x,t) given by (|2bp - (Pc)) . One sees that the 
stochastic equation (j6ap does not contain a diffusion term 
and is very different from the KPZ one. However, as will 
be shown below, the expectation {h{x,t)) of Ea. (pa|) con- 
verges for t 1 and a; ^ 1 to the expectation of LIS 
for a noise ri{x, t) with very small intensity p {p < L^"^). 
The same is valid for the variance, Var[/i(a;, t)], which 
demonstrates the KPZ scaling behavior. All that allows 
us to conjecture that the full distribution of h{x, t) for 



small p converges to a Tracy- Widom distribution of LIS. 
Let us note that the structure of Eq. ([Hb]) " ideologically" 
resembles the structure of the one-dimensional Baren- 
blatt model for the field u{x,t) with different diffusion 
constants for dtu{x,t) > and dtu{x,t) < [T3 ]. 



IV. SPDE IN THE INFINITELY RARE NOISE 
REGIME AND LIS 



In this Section we show that the limit of a telegraph- 
like noise with small intensity in the stochastic partial 
differential equation ((6a|) for the "height" function h{x,t) 
has LIS statistics. To do that, it is convenient to describe 
LIS in terms of (H-l)D Ballistic Deposition (BD) set by 
Eq.©. 

Let us begin with the limiting case of absence of any 
noise in (|6ap . The noiseless equation (|6aP is invariant 
under the scaling transformation h{x, t) h{ax, at) for 
any a. Rewriting (j6ap in a finite-difference form on the 
lattice, we have: 



h{x,t + At) = h{x,t) +^At 



h(x,t)-h{x~Ax,t) 
Ax 



1 A , / h{x.t)~h(x- 
'2'^''\ Ax 



Ax,t) 



(J) 



We can set At = Ax = 1 and x ~ j on the interval 
< j < L without any loss of generality. The noise- 
less system defined by Eq.([7]) has a characteristic time 
scale r ~ L of reaching the equilibrium. Hence, after t 
time steps, where t > r, the system described by Eq.® 
reaches its stationary state generating a nondecreasing 
staircase-like profile from any initial state. This can be 
seen recursively: 



h{j, t+l) = h{j~l, t) if h{j - 1, t) > h{j, t) 
h(j, t + 1) = h{j, t) if h{j - 1, t) < h{j, t) 



(8) 



Since ^ is valid for any < j < L, after t ^ L time 
steps, if in the initial state h{j ~ L,t ^ Q) < h{j — 
1, t = 0) then the heights a,t x — L and x — I equalize: 
h{j = L,t) = h(j — l,t). This signals the appearance of 
effective long-ranged spatial correlations in the system 
on characteristic time scales of r ^ L. 

To show the equivalence of SPDE and LIS in a noisy 
regime with small intensity, it is convenient to introduce 
the enveloping functions rm{t) and r{x, t) for long-ranged 
(LR) BD and SPDE correspondingly: 



r„(t) = max{ft.„(t), . . . 
r{x,t) = max {h(y,t)} 

0<y<x 



,hi{t)} LRBD (Eq.©) 
SPDE 



(9) 

If the intensity, p, of the noise ri{x,t) is small, after each 
noise event the function h{x^ t) has time to reach its re- 
laxed form r[x,t) before the subsequent noise contribu- 
tion is added. The corresponding intensity, p, can be 
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easily estimated. Namely, the mean time interval, r, be- 
tween positive noise events {r]{x,t) — 1) for a system of 
size L and noise intensity pisr ^ In more details this 
question is discussed in the Appendix. Since the function 
h(x, t) reaches the relaxed state r(x, t) at t > L, we can 
estimate p a.s p < L^^. Now Eqs.([T]) and ([6a|) can be 
rewritten in the unified form: 

J h,n{t + 1) = r,„(t) + I long-ranged BD Eq.© 

\h{x,t + At) = r{x,t)+r]{x,t) SPDE with p < L^^ 

(10) 

In this form the equivalence between long-ranged BD 
and SPDE in a low-intensity noise regime is clearly seen. 

Thus, relying on the equivalence between statistics of 
LIS and low-intensity solutions of SPDE (pa|) . and know- 
ing explicitly all the moments of the distribution of LIS 
(see, for example, [5|), we get the following asymptotic 
expressions for the expectation, {h{x, t)) , and for the vari- 



BD and SPDE is very good for the expectation and the 
variance — see Figl2^,b. 



YaT[hix,t)] = (/i(.x,t)2) - {h{x,t)Y 

{h{x,t)) = tp + 2^/xtp 
Ya.T[h{x,t)] = c{xtpy/^ 



(11) 
(12) 



where h{x, t) is the asymptotic solution of the equa- 
tion ([6a|) with the noise distribution (l2c|) . In (fT2|) c = 
{[x-{x)?) = 0.8132... and x has the Tracy-Widom 
distribution for Gaussian Unitary Ensemble. Note that 
for p = i and x = t = L {L ^ 1) wc arrive in (fTT|) 
at the Vershik-Kerov result for the expectation of LIS, 
{h{L)) = 2Vl. To be rigorous, this expression is beyond 
our approximation since we are limited by the noise in- 
tensity Per — -j^- So, it is more convenient to work with 
the general expressions ([TT1) -(fT2 |) . 

We confirm our analytic predictions (fTT|) -([T2 |) by nu- 
merical computations, simulating directly the ballistic 
deposition process H]) and the stochastic partial differ- 



ential equation (j6a[) with noise intensity P — and 
initial condition h{x,t = 0) = 0. Note that the in- 
tensity of the noise in our numerical simulations is ^ 
times larger than the upper estimate p ^ however 
the agreement with theoretical results PT|) - P^ is still 
very good. In particular, we plot in FiglSK the aver- 
age profiles {h{x,t)) for long-ranged BD and SPDE as a 
function of a: (0 < a; < L) for L = 10'^ at time points 
t = 2 X 10'', 6 X 10^,10^. In the same figure we plot 
the analytic expression ITTI) . In FiglHJs we plot the vari- 
ance V'dr{h{x,n)) = {h{x,tf) - {h{x,t)Y function 
of X for the same time points t. It should be noted 
that due to the numerical procedure of the definition 
of the height, the numerical computation of the vari- 
ance leads to the following interpolating expression for 
Var[/i(x, t)] = a{t) + cixptY^'^. However for a; 1 and 
t = const we arrive asymptotically at the expression (fT2)) . 
That is why we compare the expression (jl2l) with the nu- 
merical data for large x only where the term a(t) is neg- 
ligible. Moreover, the agreement between long-ranged 
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FIG. 3: (a) The mean profile {h{x,t)) as a function of x for 
time points f = 2 x 10^,6 x 10*, 10^ for BD and SPDE in 
comparison witli the analytical results (|11[) : (b) The variance 
Var(/i(a;, t)) as a function of x at time points t = 2 x 10*, 6 x 
10*, 10^ for BD and SPDE in comparison with the analytic 
result 1121). The length of the system here is L = 1000. 



CONCLUSION 



We have shown by the sequence of mappings that 
the expectation and the variance of the random profile 
h(x,t), described by the stochastic nonlinear partial dif- 
ferential equation (|6ap in the limit of a noise with very 
low intensity, coincides with the expectation (|lip and the 
variance ([T^ of the longest nondecreasing subsequence of 
the sequence of random integers. This statement is also 
confirmed numerically. On the basis of the obtained re- 
sults we conjecture that not only the first moments, but 
the full probability distribution function of the random 
variable h{x,t) coincides with the Tracy-Widom distri- 
bution appearing in largest eigenvalue statistics of en- 
sembles of random matrices. 



Let us note that our analysis of Eq. ([6a]) is a bit indl 
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rect. It would be very desirable to get expectation and 
variance pT|) - p^ of the random variable h{x, t) directly 
from the solution of the Fokker-Planck equation which 
corresponds to the Langevin equation (|6ap . However on 
this way we have met some difficulties. Namely, rewrite 
Eq. ([6a|) as follows 



dth{x,t) = i \dxh{x,t)\ 



1 



dMx,t)+£,{x,t)+p (13) 



where S,{x,t) = ri{x,t) - p; £,{x,t) = 0; £,{x,t)(,{x' ,t') ^ 
2D6xy6t^t' and D = ^p{l—p). Now we can write the for- 
mal expression for the Fokker-Planck variational equa- 
tion for the function W{h,t) (see, for example, [Tst): 

-Idxl{{^\d.h\~ld.h + p)Wih,t)} 
+Djdxi^W{h,t) 

(14) 

which in turn can be rearranged in a form of two coupled 
equations: 



awjh.t) 

dt 



JW{h,t] 
Sh 



+ ^—sfP — 



/ dx (^—p- 

(Jdxd^h'-^ iid^h<Q (15) 
1 ■■dd^h>Q 



where we have taken into account that jy^[dxh{x, t)] 



0. 



The Fokker-Planck equation p5|l corresponds to the 
stochastic process (|6b[) which can be visualized alterna- 
tively as a (2-f l)-dimensional correlated growth. Using 
the finite-difference form ^ of (I6bp on the lattice, where 
we set Ax = Ai = 1 and a; = j, we arrive at the following 
stochastic recursion relation 

h{x, i + 1) = h{x., t) + ri{x, t) 

{h{x, t) - h{x - 1, t) if h{x, t) < h{x - 1, t) 
\ih{x,t)>h{x-l,t) 

(16) 

with the initial and boundary conditions h{x, t — Q) = 
h(x — 0,t) = 0. Create now the initial configuration 
distributing the random variable 77(0;, t) in the plane x > 
l,t> 1. Since r]{x, t) takes the values or 1, we can show 
7y = 1 by black unit segments as it is shown in FiglH The 
initial configuration at t = 1 is depicted in Figl4^. Now 
applying the equation we can constrict recursively 
the configuration of the field h(x,t) at subsequent time 
moments. Few configurations of the field h(x, t) for given 
initial distribution of 77(2;, t) is shown in FigdJa-d for time 
moments t = 2, 3, 4 and a; = 1, ...5. 

The (2-|-l)-dimensional stochastic growth shown in 
FigUlbijectively corresponds to the short-ranged ballistic 
growth considered in Section [IIII and, hence, in the limit 
of small intensity p should correspond to the statistics of 
LIS. It would be very desirable to simulate numerically 
directly the growth model shown in Fig[5] to check the 
validity of the conjectured bijection. 
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FIG. 4: Visualization of the stochastic growth determined by 
the recursion relation p6p for a; = 1, 5 and for several time 
slices t = l (a), t = 2 (b), t = 3 (c), t = 4 (d). 
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APPENDIX A: EQUILIBRATION TIME FOR 
SPDE 



In the limit of small p (p <C 1) the actual mean profile 
{h{x, t)) tends to the curve y(x, t) = tp+2y/xtp. However 
for larger values of p (p ~ 1) the resulting profile does not 
have enough time to relax and hence is displaced below 
the curve y(x,t). The question which we address here 
concerns the estimation of the time interval, t, between 
the subsequent deposition events sufficient to approach 
the stable solution y{x,t). 

We can estimate the equilibration time, t, from the 
following arguments. The slope of the mean profile in 
the stationary state is 



dy{x,t) 
dx 



(Al) 



Hence, the average length of the horizonal "plateau" is 
about 



(A2) 




9y{x,t) 
dx 



We define the equilibration time, t, as a time t in (|A2|) 
during which the length l{x), of the plateau becomes that 
of the order of the system size, L. Substituting for x in 
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(IA2p the maximal value x = L, we arrive at the following 
estimate 




what gives us 

1 



Comparing the equilibration time (jA3p with the charac- 
teristic time of the system relaxation, r ~ L, we arrive 
at the estimate for the critical noise intensity, pcr, 

1 

below which the stochastic partial differential equation 
([6a|) has long-ranged behavior typical for the LIS prob- 
lem. 
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